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ABSTRACT 



A method for coding an input video signal with a field rate 
of 60 Hz derived from a motion picture film source using 
2-3 pulldown. In the method, duplicate fields are detected in 
the input video signal. Each duplicate field is eliminated 
from the input video signal to produce a progressive video 
signal comprising plural frames with a frame rate of 24 Hz. 
Finally, the progressive video signal is coded to produce a 
coded video signal. Preferably, when a duplicate field is 
detected in the input video signal, a control signal is gen- 
erated in response to each detected duplicated field. Each 
control signal is then included in the coded video signal.. 

25 Claims, 18 Drawing Sheets 



FMC 



VI- 



(60HzVID^^^Y»t 
FIELD [CIRCUIT 

SIGNAL) T 



2-3 

PULL-DOWN 
DETECTION 



102 



MACRO B LOCK ADDRESS 
I04x 



RATE 
HCONVERSION 
CIRCUIT 

i — c 



PICTURE_C0DIN6-TYPE 



£ 



FIELD ORDER 
J RE- 
ARRANGING 
CIRCUIT 



106 



- CODER 



F 



103 TOP. FIELD- FIRST 

NUMBER -OF- FIELD- DISPLAYED- CODE 



105 



100 CODING DEVICE 



1 



E C 
C I 
C R 
C 
U 



CORDING 



MODULATION 
CIRCUIT 



M07 



108 



RECORDING 
MEDIUM 



M09 1 



NUMBER -OF. FIELD- DISPLAYED- CODE 



DEMODULATION 
CIRCUIT 



w no 



ECC 

DECODING 
CIRCUIT 



III 



TOP. FIELD- FIRST 
1 



DECODER 



112 



RATE 

CONVERSION 
CIRCUIT 



Vo 



T 



TEMPORAL- REFERENCE 



VO 

(60 Hz VIDEO 
FIELD SIGNAL) 

"113 



101 



ADDRESS 
DECODING DEVICE 



Exhibit 22, page 1 



U.S. Patent Oct 24, 1995 Sheet 1 of 18 



5,461,420 




Exhibit 22, page 2 



U.S. Patent Oct 24, 1995 Sheet 2 of 18 



5,461,420 




Exhibit 22, page 3 



U.S. Patent 



Oct 24, 1995 



Sheet 3 of 18 



5,461,420 



ro 
6 




o 
u. 



01 
UJ 

Q O 

or z 

o oh 

3 <3 

U.0C<O 



o 



Z 
O 

moo 



8 



i6 
8i=i= 

Tdfcj* 

CVJQ.OO 



CM 
O 



"~JTT 



> ^ 



g 

>Q< 
N_]Z 

IUJO 

Su.cn 



UJ 

o 
> 

UJ 
Q 

0 



Q 

8 



8 

hi 

>u_c/> 



q: 
uj 

Q 

o 



yuittrtt 
u_cc<o 



z 

13 



or 



co 

(Et- 
ui =: 

*>3 

[too 



ro 



UJ 
Q 
O 
O 



o 

Ll 



CM 




UJ 

o 
> 

UJ 

p 

© 
z 

Q 
O 

a 

Q 



_ O 



Exhibit 22, page 4 



U.S. Patent 



Oct. 24, 1995 



Sheet 4 of 18 



5,461,420 




Exhibit 22, page 5 



U.S. Patent Oct 24, 1995 Sheet 5 of 18 5,461,420 




I —J t — J 



Exhibit 22, page 6 



U.S. Patent 



Oct 24, 1995 



Sheet 6 of 18 



5,461,420 



O 
li. 

> 

5 



o 
> 

I 



tr 




UJ 
















m 




1 





I 

UJ 

mi 
<l-o: 
— ouj 
q:zq 
<ujo 

>-JO 



to 
o 



CP 

6 

Ll 





o 



CM 



O 



I 

111 

CO 















- \ 








— f 

FIELD 
MEMORY A 


FIELD 
MEMORY B 


FIELD 
MEMORY C 


FIELD 
MEMORY D 























Z> 
O 

CO* 
CO o 
LU 

a: 

CCCD 



p 

o 

Q 
UJ 

q: 

CL 



3& l fc 

5Q.5QO 



UJ 
Q 
O 



O 
LU 



a: 

i 

> 



O 

& 
5 



CO 



Exhibit 22, page 7 



U.S. Patent 



Oct 24, 1995 



Sheet 7 of 18 



5,461,420 



F I G. 7 




Exhibit 22, page 8 



U.S. Patent 



Oct 24, 1995 



Sheet 8 of 18 



5,461,420 



o 



[ 



fe§~a 

ujoo 



ouj_mJOl-Oft: 



00 

6 

Ll. 



CO 

0 



UJ 

31 

UJ LJ 



>- 
ft: 

38 

UJ lD 



6 



o 

3l 

UJ UJ 



_ ft: 

38 

uj lD 



o 



ft:^ 



zz 

>UJ 



woo 



tLftouo-oi-oft: 



■5» 
p 



u>\-, 

C/)<t- 

<oo 



T 



_j 
< 
z 
o 

o 

UJ 

3 



UJ 

u. 



o 
op. 
</><t 

UJ0C3 
QftUJO 

<QZ[£ 

U_|C>UJ — 

o:<oo 



cvj 



UJ 
Q 
O 



O 

Q 
UJ 

q: 
a. 



> 

2 
O 

fe 



> 
2 



Exhibit 22. page 9 



U.S. Patent 



Oct 24, 1995 



Sheet 9 of 18 



5,461,420 



0> 
6 



CD 
O 




UJOO 



CO 
CO 
UJ 

tr 

a 

3 

m 

o 

a 



I 



q: 

o 
O 
o 



ID 

o 



o 



S s 



o 
u_ 



u.a:<o 
I 



o 

CO , 

cc t 

(TOO 



to 
o 



> 

UJ 

o 



Ui 

o 

8 

o 

UJ 

5- 

5 ? 

9 I 

§ SI 

1 

uj 

CD 

5 

.3 
2 



IU 
O 

8 

i 

O 
UJ 

>- 



a. 

CO 

Q 

I 

o 

u. 

u! 
o 

I 

ui 
m 
2 

3 



o 

UJ 

u. 

til 



CVJO.OO 



CM 

o 



b 
> 



§1 

CDU. 



?cdu_ ro 

i — \y~ 



2 
O 

CO 

uj =: 
So* 



cr 
Li 

Q 
O 

& 
Q 



o 
> 



to 

CO 

UJ 
a: 
q 
a 
< 



© 

Q 5 

o8| 

O UJ — 
UJ Q O 



> y 



S 

593 

IliJCD 
O— — 
CDU. CO 




UJ 

o 
> 

UJ 

o 

e> 

2 
O 

§ 



Exhibit 22, page 10 



U.S. Patent Oct 24, 1995 Sheet 10 of 18 5,461,420 



rn 


o 


/-n L m en 


r~ 


CD <fr 

lO to 
U J I* / 


o o 


sj- CVJ 
CM O 




<~ 

O 


- o 



UJ 

o 

D 

8 



0. 

> > 



o 



V) 
I 

D 



Q 

UJq 



F— ZD — 



Exhibit 22, page 1 1 



U.S. Patent 



Oct 24, 1995 



Sheet 11 of 18 



5,461,420 



CD 



to 




s 

Q 



h- 
3 
Q_ 
2 



Exhibit 22, page 12 



U.S. Patent Oct. 24, 1995 sheet 12 of is 5,461,420 




Exhibit 22, page 13 



U.S. Patent Oct 24, 1995 Sheet 13 of 18 



5,461,420 




Exhibit 22, page 14 



U.S. Patent Oct 24, 1995 



Sheet 14 of 18 



5,461,420 



1 1 trv) 




Exhibit 22, page 15 



U.S. Patent Oct 24, 1995 Sheet 15 of 18 5,461,420 



F I G. 15 



FIXED 
VOLUME 



811 

\ 









ENCODER - 








EMPTY 
VOLUME 


OCCUPATION 
VOLUME a 







OCCUPATION 




VOLUME a 




OCCUPATION 




VOLUME b 


t 

i 
I 



802 




RECODING MEDIUM 
803 



EMPTY 
VOLUME 



\ 



OCCUPATION 
VOLUME b 



804 



DECODER 



805 



Exhibit 22, page 16 



U.S. Patent Oct 24, 1995 sheet 16 of is 5,461,420 



ill s 




L__ 



1 

CD 



Exhibit 22, page 17 



U.S. Patent 



Oct 24, 1995 



Sheet 17 of 18 



5,461,420 




Exhibit 22, page 18 




Exhibit 22, page 19 



5,461,- 

1 

APPARATUS FOR CODING AND DECODING 
A DIGITAL VIDEO SIGNAL DERIVED FROM 
A MOTION PICTURE FILM SOURCE 

FIELD OF THE INVENTION 5 

The present invention relates to apparatus for coding and 
decoding a digital video signal with a field rate of 60 Hz 
derived from a motion picture film source with a frame rate 
of 24 Hz. 10 

BACKGROUND OF THE INVENTION 

The Motion Picture Experts Group (MPEG) standard is 
representative of a standard for compressing digital video 
signals for transmission or storage. The standard was dis- 15 
cussed by ISO-IEC/TTC1/SC2/WG11 and has been pro- 
posed as a draft standard. The standard stipulates a hybrid 
compression method, combining motion compensated pre- 
diction coding with discrete cosine transform (DCT) coding. 

The first compression technique, motion compensated 20 
prediction coding, takes advantage of the correlation of 
video signals in the time domain. According to this method, 
the video signal representing the current picture (a frame or 
a field) is predicted from the decoded and reproduced 
(reconstituted) video signal representing a reference picture, 
which is a picture that is earlier or later than the current 
picture. Only the motion prediction errors between the video 
signal representing the current picture and the reconstituted 
video signal representing the reference picture are transmit- 
ted or stored. This significantly reduces the amount of digital 30 
video signal required to represent the current picture. 

The second compression technique, DCT coding, takes 
advantage of the mtra-picture, two-dimensional correlation 
of a video signal. According to this technique, when a block S5 
of the current picture, or a block of motion prediction errors, 
is orthogonally transformed, signal power is concentrated in 
specific frequency components. Consequently, quantizing 
bits need only be allocated to the DCT coefficients in the 
region in which the signal power is concentrated. This 40 
further reduces the quantity of digital video signal required 
to represent the picture. For example, in a region in which 
the image has little detail, and in which the video signal is 
thus highly correlated, the DCT coefficients are concentrated 
at low frequencies. In that case, only the DCT coefficients in 4j 
the low-frequency region of the distribution pattern are 
quantized to reduce the quantity of the digital video signal. 

Because the coding techniques of the MPEG standard are 
basically intended for use with interlaced video signals, 
problems arise when they are applied without modification 50 
to non-interlaced video signals. In particular, the compres- 
sion ratio can be impaired when the MPEG techniques are 
applied to non-interlaced video signals. 

A motion picture consists of a sequence of still pictures 
reproduced in succession, normally 24 pictures per second, ss 
A motion picture film source, e.g., a motion picture film or 
a 24-frame video signal, represents each picture of the 
motion picture as a full frame with a frame rate of 24 Hz, 
whereas an interlaced video signal represents each picture of 
the motion picture as two consecutive fields, each field 60 
representing half of the picture and being displaced from one 
the other by one line. An NTSC interlaced video signal has 
a field rate of 60 Hz. Consequently, deriving an interlaced 
video signal with a field rate of 60 Hz from a motion picture 
film source with a frame rate of 24 Hz, such as is done using 65 
a telecine machine, requires a conversion between the num- 
ber of frames per second of the film source and the number 



2 

of fields per second in the video signal. 

A motion picture film source with a 24 Hz frame rate is 
commonly converted to an interlaced video signal with a 60 
Hz field rate, such as an NTSC video signal, by a technique 
known as 2-3 pull-down. FIG. 1 illustrates how 2-3 puD- 
down works. 

The 2-3 pull-down process involves a repetitive sequence 
of deriving two fields of the video signal from the first of 
every two consecutive frames of the motion picture film 
source, and deriving three fields of the video signal from the 
second of the two consecutive frames of the film source. In 
FIG. 1, frames 800 and 801 are consecutive frames of a 
motion picture film source with a frame rate of 24 Hz. In the 
figure, each film source frame is divided into an odd field, 
indicated by a solid line, and an even field, indicated by a 
broken line. 

First, two fields of the video signal are derived from the 
first film source frame 800. The video field 802, an odd field, 
is first derived from the first film source frame 800, followed 
by the second video field 803, an even field. Then, three 
fields of the video signal are derived from the second film 
source frame 801. The video field 804, an odd field, is first 
derived, followed by the video field 805, an even field, 
followed by the video field 806, another odd field. The two 
odd fields 804 and 806 are identical to one another. This 
process is repeated for the other two film source frames 808 
and 809 from which the video fields 810 through 814 are 
derived. Note that an even field 810 is derived first from the 
film source frame 808, and that two even fields 812 and 814 
are derived from the film source frame 809. With the 
arrangement shown, a sequence of ten fields of the video 
signal is derived from a sequence of four frames of the 
motion picture film source, after which the sequence is 
repeated 

FIG. 2 shows the result of combining into frames con- 
secutive pairs of fields of the interlaced video signal derived 
by the process shown in FIG. 1. The video fields 900 and 901 
are derived from the same film source frame. Video fields 
902 and 903 are also derived from the same film source 
frame. Hence, the video frame 907, produced by combining 
the video fields 900 and 901, and the video frame 908, 
produced by combining the video fields 902 and 903, are 
each derived from the same film source frame. On the other 
hand, the video frame 909, produced by combining the 
consecutive video fields 904 and 905 is derived from two 
different film source flames. 

When MPEG coding is applied to the flames of a non- 
interlaced video signal derived from an interlaced video 
signal, which, in turn, is derived a motion picture film source 
using 2-3 pulldown, coding the flames 907 and 908 in the 
above example presents no problems because these flames 
are each derived from a single film source frame, and are 
thus internally correlated. However, difficulties can be 
encountered when coding the video frame 909 because it is 
derived from two different flames of the film source, and, 
hence, it is not necessarily internally correlated. 

If the motion picture is fast-moving, or if a scene change 
occurs within the frame, a video frame derived from two 
different flames of the film source has low vertical correla- 
tion, which reduces the efficiency of DCT-based signal 
compression. Moreover, motion compensated prediction can 
also go wrong because of the reduced correlation of the 
video signal. 
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OBJECTS AND SUMMARY OF THE 
INVENTION . 

In view of the problems described above, it is an object of 
the present invention to provide a video signal coding 
method and apparatus which allows efficient coding of a 
video signal derived from a motion picture film source using 
2-3 pull-down technique, a corresponding video signal 
decoding method and apparatus, and a recording on which 
a coded recording signal is recorded. 

Accordingly, the invention provides a method for coding 
an input video signal with a field rate of 60 Hz derived from 
a motion picture film source using 2-3 pulldown. In the 
method according to the invention, duplicate fields are 
detected in the input video signal. Each duplicate field is 15 
eliminated from the input video signal to produce a pro- 
gressive video signal comprising plural frames with a frame 
rate of 24 Hz. Finally, the progressive video signal is coded 
to produce a coded video signal. 

Preferably, when a duplicate field is detected in the input 20 
video signal, a control signal is generated in response to each 
detected duplicated field. Each control signal is then 
included in the coded video signal. 

Hie invention also provides a method of decoding a coded 
video signal to provide an interlaced video signal with a field 25 
rate of 60 Hz. The coded video signal is a signal derived by 
coding a progressive video signal having a frame rate of 24 
Hz. The progressive video signal was in turn derived from 
an interlaced video signal with a field rate of 60 Hz by 
eliminating duplicate fields. The coded signal includes a 30 
control signal indicating each frame of the progressive 
signal from which a duplicate field was removed. In the 
method, the coded video signal is decoded to provide the 
progressive video signal. The control signal is extracted 
from the coded video signal. Then, finally, three fields of the 35 
interlaced video signal are derived from certain frames of 
the progressive video signal and two fields of the interlaced 
video signal from the rest of the frames of the progressive 
video signal in response to the control signal. 

Three fields of the interlaced video signal are preferably 40 
derived from those frames of the progressive video signal 
indicated by the control signal as being frames from which 
a duplicate field was eliminated. 

The invention also provides a recording comprising a 
recording medium and a recording signal recorded in the 
recording medium. The recording signal includes a coded 
progressive video signal comprising plural frames from 
which a duplicate field has been eliminated. The recording 
signal also includes a control signal indicating the frames 
from which a duplicate field has been eliminated. 

The invention further provides an apparatus for coding an 
input video signal with a field rate of 60 Hz derived from a 
motion picture film source using 2-3 pulldown. The appa- 
ratus comprises a circuit that detects duplicate fields in the 55 
input video signal and a circuit that eliminates each dupli- 
cate field from the input video signal to provide a progres- 
sive video signal having a frame rate of 24 Hz. Finally, the 
apparatus comprises a coding circuit that codes the progres- 
sive video signal to provide a coded video signal. 6Q 

The detecting circuit may include a circuit that generates 
a control signal in response to each detected duplicated field, 
and the coding circuit may additionally comprise a circuit 
that includes each control signal in the coded video signal. 

The invention yet farther provide an apparatus for decod- 65 
ing a coded video signal to provide an interlaced video 
signal with a field rate of 60 Hz. The coded video signal is 



45 



50 



a signal derived by coding a progressive video signal with a 
frame rate of 24 Hz. The progressive video signal is, in turn, 
derived from an interlaced video signal with a field rate of 
60 Hz by eliminating duplicate fields. The coded signal 
includes a control signal indicating each frame of the 
progressive signal from which a duplicate field was 
removed. The apparatus comprises a decoding circuit that 
decodes the coded video signal to provide the progressive 
video signal; and a circuit that extracts the control signal 
from the coded video signal. Finally, the apparatus com- 
prises a circuit that derives three fields of the interlaced 
video signal from certain frames of the progressive video 
signal and derives two fields of the interlaced video signal 
from the rest of the frames of the progressive video signal in 
response to the control signal. 

The deriving circuit preferably derives three fields of the 
interlaced video signal from those frames of the progressive 
video signal indicated by the control signal as being frames 
from which a duplicate field was eliminated. 

The invention finally provides a system for deriving a 
recording signal from an input video signal and for repro- 
ducing the recorded signal to provide an output signal. The 
recording signal has a bit rate that is substantially lower than 
the bit rate of the input video signal and the output video 
signal. The input video signal and the output video signal 
have a field rate of 60 Hz. The input video signal is derived 
from a motion picture film source using 2-3 pulldown. The 
system comprises and encoding apparatus and a decoding 
apparatus. The encoding apparatus includes a detecting 
circuit that detects duplicate fields in the input video signal, 
and a circuit that eliminates each duplicate field from the 
input video signal to provide a progressive video signal 
having a frame rate of 24 Hz. The encoding apparatus also 
includes a coding circuit that codes the progressive video 
signal to provide the recording signal. The decoding appa- 
ratus includes a circuit that decodes the recording signal to 
provide the progressive video signal, and a field deriving 
circuit that derives three fields of the interlaced video signal 
from certain frames of the progressive video signal and 
derives two fields of the interlaced video signal from the rest 
of the frames of the progressive video signal. 

Preferably, the detecting circuit in the encoding apparatus 
includes a circuit that generates a control signal in response 
to each detected duplicated field, and the coding circuit 
additionally includes each control signal in the recording 
signal. The decoding apparatus also preferably additionally 
includes a circuit that extracts the control signal from the 
recording signal, and the field deriving means derives three 
fields of the interlaced video signal from certain frames of 
the progressive video signal and two fields of the interlaced 
video signal from the rest of the frames of the progressive 
video signal in response to the control signal from the 
extracting circuit 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 illustrates the operating principles of the 2-3 
pull-down process. 

FIG. 2 depicts how the efficiency of coding drops when 
applied to frames resulting from fields derived from different 
film source frames using the 2-3 pull-down process. 

FIG. 3 is a block diagram of a coding apparatus and a 
decoding apparatus constituting an image processing appa- 
ratus of a first embodiment of the invention. 

FIG. 4 is a block diagram of the 2-3 pull-down detection 
circuit included in FIG. 3. 
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FIG. 5 shows how duplicate fields handled by the rate 
conversion circuit in FIG. 3. 

FIG. 6 is a block diagram of the encoder 105 in FIG. 3. 

FIG. 7 shows how the motion prediction modes are 
selected in the encoder. 

FIG. 8 is a block diagram of the decoder 113 of the 
decoding apparatus shown in FIG. 3. 

FIG. 9 is a block diagram of a coding apparatus and a 
decoding apparatus constituting an image processing appa- 
ratus of a second embodiment of the invention. 

FIG. 10 illustrates how the various control signals are 
generated in the rate conversion circuit 103 shown in FIG. 
9. 

FIG. 11 is a block diagram of the field order rearranging 
circuit shown in FIG. 9. 

FIG. 12 is a block diagram of the encoder 105 shown in 
FIG. 9. 

FIG. 13 is a graph showing the state of the encoder buffer 
407 in the encoder 105 shown in FIG. 12 and of the decoder 
buffer 701 in the decoder 112 shown in FIG. 16. 

FIG. 14 is a graph showing the state of the encoder buffer 
407 in the encoder 105 shown in FIG. 12 and of the decoder 
buffer 701 in the decoder 112 shown in FIG. 9. 

FIG. 15 is a block diagram illustrating the concept of the 
video buffering verifier. 

FIG. 16 is a block diagram of the decoder 112 and the rate 
conversion circuit 113 shown in FIG. 9. 

FIG. 17 illustrates how the decoding apparatus derives a 30 
video signal with a field rate of 60 Hz from a recorded signal 
with a frame rate of 24 Hz recorded according to the first 
variation of the first recording method. 

FIG. 18 illustrates how the rate conversion circuit derives 
a video signal with a field rate of 60 Hz from a recorded 
signal with a frame rate of 24 Hz recorded according to the 
second variation of the first recording method. 
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A first embodiment of the present invention will first be 
described with reference to FIG. 3, which shows a block 
diagram of the coding apparatus 100 and the decoding 
apparatus 101. 

The coding apparatus 100 will be described first The 
coder input signal VI, an interlaced video signal with a field 
rate of 60 Hz is fed into the 2-3 pull-down detection circuit 
102 which will be described in detail below. Each time the 50 
2-3 pull-down detection circuit 102 detects a duplicated 
field in the coder input signal VI, it generates a field mode 
change signal FMC, which it sends to the rate conversion 
circuit 103. In response to the field mode change signal 
FMC, the rate conversion circuit 103 removes each dupli- 
cated field from the coder input signal VI, and sends the 
resulting video signal to the field order re-arrangement 
circuit 104. The field order re-arrangement circuit 104 
converts the signal from the rate conversion circuit 103 into 
a progressive (non-interlaced) picture signal having a frame- 
rate of 24 Hz. The encoder 106 then compresses and codes 
the picture signal, and feeds the result to the ECC circuit 
106, which adds error correction codes. The modulation 
circuit 107 modulates the signal from the ECC circuit for 
recording on the recording medium 108. 

The decoding apparatus 101 receives the signal repro- 
duced from the recording medium 109. The recording 
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medium 109 is the same as, or is derived from, the recording 
medium 108 on which the signal generated by the coding 
apparatus 100 is recorded. The reproduced signal is demodu- 
lated by the demodulation circuit 110, and fed to the ECC 
decoding circuit 111, where error detection and correction is 
applied. The decoder 112 decodes the signal from the ECC 
decoding circuit into pictures with a frame rate of 24 Hz. The 
rate conversion circuit 113 converts the picture signal with 
a frame rate of 24 Hz into a video signal with a field rate of 
60 Hz. The field order re-arrangement circuit 114 returns the 
field order of the video signal with a 60 Hz field rate from 
the decoder 112 to that of the coder input signal VI, and 
provides the decoder apparatus output signal VO with a field 
rate of 60 Hz. 

Operation of the 2-3 pull-down detection circuit 102 will 
now be described with reference to FIG. 4. The field delay 
circuits 201 and 202 convert the coder input signal VI, a 
video signal with a field rate of 60 Hz, into the delayed 
signal VP1, by a time delay equal to two field periods, i.e., 
V$o second. The difference calculator 203 receives the 
delayed signal VP1 and the coder input signal VI, and 
calculates the difference VP2 between each corresponding 
picture element (pixel) in the two signals. 

The absolute value calculator 204 calculates the absolute 
value VP3 of the difference VP2 calculated for each pixel by 
the difference calculator 203, and feeds the result to the 
accumulator 205, which calculates the sum of the absolute 
value of the difference for each pixel in the field. The 
comparator 206 compares the resulting absolute value dif- 
ference sum with a threshold value TH. When the frame of 
the coder input signal VI is a duplicated field, and can thus 
be removed, the absolute value difference sum VP4 is 
smaller than the threshold value TH, and the comparator 206 
generates the field mode change signal FMC. 

The video signal VII, delayed by one field period relative 
to the video signal VI by the field delay circuit 201, is fed 
to the rate conversion circuit 103, the operation of which is 
illustrated in FIG. 5. When the delayed video signal VU fed 
into the rate conversion circuit 103 (FIG. 1) is an interlaced 
video signal with a field rate of 60 Hz and is derived from 
a motion picture film source using 2-3 pull-down, as 
described above, the field 301 and the field 302 originate 
from the same film source frame. The fields 303 to 305 also 
all originate from one film source frame, different from that 
from which the fields 301 and 302 originate. Since the field 
303 and the field 305 are identical (duplicated fields) as a 
result of the 2-3 pull-down, the field 305 provides excess 
information. 

Accordingly, when the field mode change signal FMC 
from the 2-3 pull-down detection circuit 102 indicates that 
a field, such as the field 305, is a duplicated field, the rate 
conversion circuit 103 treats the field as a duplicated field 
and removes the field from the video signal VII. The rate 
conversion circuit then sends the resulting video signal VI4 
to the field order re-arrangement circuit 104, which rear- 
ranges to order of the fields in the video signal VI4 to that 
required by the coding order of the encoder 105. The field 
order rearrangement circuit 104 may also interleave the two 
fields constituting each frame to provide a progressive 
picture. 

FIG. 6 is a block diagram of the encoder 105. The video 
signal V14 from the field-order rearrangement circuit 104 is 
fed to the blocking circuit 401, which divides the signal VI4 
into macro blocks of, preferably, 16x16 pixels. Each macro 
block is fed to the difference detector 403 via the motion 
detection circuit 402, which will be described below. 
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The difference detector 403 also receives macro blocks of 
motion-compensated pixels from the field memory set with 
motion compensation formed by the field memories 411 to 
414 and the predictor 415, which will also be described 
below. The difference detector determines the pixel-by-pixel 
differences between the macro block of pixels and the macro 
block of motion-compensated pixels. 

The macro blocks of motion prediction errors from the 
difference detector 403 are fed to the DCT circuit 404, which 
orthogonally transforms the motion prediction errors in 
blocks obtained by dividing each macro block in four. The 
DCT circuit 404 preferably applies a discrete cosine trans- 
form (DCT) to each block. The DCT coefficients provided 
by the DCT circuit 404 are fed to the quantizer 405 where 
they are quantized using an adaptively allocated number of 
bits. The quantized DCT coefficients are then fed to the 
variable-length coder 406, where variable-length coding 
such as Huffman coding, or run-length limited coding, is 
applied. The variable-length coder 406 also combines the 
motion vector MV, the prediction mode signal PM, and the 20 
field mode change signal FMC with the quantized DCT 
coefficients. The output of the variable length coding circuit 
406 is fed into the encoder buffer 407, which provides the 
encoder output signal VC1, normally at a constant bit rate. 
It is to be noted that, though omitted from FIG. 6, a signal 
for preventing the encoder buffer 407 from overflowing or 
underfl owing is fed back from the encoder buffer 407 to the 
quantizer 405. 

The quantizer 405 also feeds the quantized DCT coeffi- 
cients to the field memories 411 to 414 with modon com- 
pensation via the dequantizer 408, the inverse DCT circuit 
409, the adder 410, and the selector 417. The dequantizer 
reverses the quantizing performed by the quantizer 405, and 
the inverse DCT circuit 409 reverses the DCT processing 
performed by the DCT circuit 404. The adder 410 reconsti- 
tutes a macro block of the current picture by adding each 
macro block of reconstituted motion prediction errors from 
the inverse DCT circuit 408 to a motion-compensated macro 
block of a reference picture derived from one or more earlier 
pictures stored in the field memories 411 through 414 by the 40 
predictor 415. After the current picture has been completely 
reconstituted, it may then be stored in one of the field 
memories 411 through 414 selected by the selector 417 to 
serve as a reference picture for coding later pictures. 

The macro blocks of pixels from the blocking circuit 401 
are also fed into the motion detection circuit 402, which 
determines a motion vector for each macro block and also 
generates an absolute value difference sum for each macro 
block. The motion detection circuit 402 feeds the absolute 
value difference sum to the motion prediction mode deter- 
mination circuit 418, which determines the motion predic- 
tion mode, as will be described below. The macro blocks of 
pixels also pass from the blocking circuit 401 through the 
motion detection circuit 402 to the difference detection 
circuit 403, which is described above. 

The method by which the prediction mode of each macro 
block is selected will now be described in the case of 
bidirectional predictive coding (B-picture) with reference to 
FIG. 7. Three prediction modes are available: 

(1) Forward prediction from an earlier reference picture; 

(2) Linear prediction from both earlier and later pictures 
(each pixel in the macro block of the current picture is 
calculated by a linear calculation, such as by calculating an 
average value, from a pixel in a reference macro block in an 
earlier picture and a pixel in a reference macro block in a 
later picture; and 
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(3) Backward prediction from a later reference picture. 

If the absolute value difference sum of the prediction error 
between the current picture and an earlier reference picture 
determined by the motion detection circuit 402 is repre- 
sented by X, and the absolute value difference sum of the 
prediction error between the current picture and a later 
reference picture is represented by Y, then, as shown in FIG. 
7: 

when Y>jX, corresponding to the region 601, the motion 
prediction mode determination circuit 418 selects for- 
ward prediction from an earlier field or frame; 
when kX^Y^jX, corresponding to the region 602, the 
motion prediction mode determination circuit 418 
selects linear prediction from both earlier and later 
fields or frames; and 
when Y<kX, corresponding to the region 603, the motion 
prediction mode determination circuit 418 selects back- 
ward prediction from a later field or frame. 
The motion prediction mode determination circuit 413 
supplies the prediction mode PM and the motion vector MV 
to the predictor 415 of the field memory set with a motion 
compensation, and to the read address generation circuit 
1016. The read addresses generated by the address genera- 
tion circuit 1016 in accordance with the prediction mode PM 
and the motion vector MV are supplied to the field memories 
411 to 414. The address generation circuit 1016 generates 
field memory addresses that are offset from the pixel 
addresses of the current macro block by the amount speci- 
fied by the motion vector MV. Macro blocks of pixels are 
read out from the field memories according to addresses 
supplied by the read address generation circuit 1016, and are 
supplied to the predictor 415, which performs selection and 
interpolation in accordance with the prediction mode PM. 
Thus, the field memories 411 to 414 with motion compen- 
sation and the predictor 415 perform motion compensation 
using the prediction mode PM and the motion vector MV. 

The decoder 112 of the decoding apparatus 101 of the first 
embodiment will now be described in detail with reference 
to the block diagram shown in FIG. 8. 

The decoder input signal VD3 to the decoder 112 is 
temporarily stored in the decoder buffer 701. The variable 
length decoder 702 reverses the variable length coding of the 
DCT coefficients received from the decoder buffer, and 
extracts the motion vector MV, the prediction mode PM, and 
the field change mode signal FMC. The dequantizer 703 
dequantizes the quantized DCT coefficients, and the inverse 
DCT circuit 704 transforms the DCT coefficients into blocks 
of motion prediction errors. The dequantizer 703 and the 
inverse DCT circuit 704 are constructed to have character- 
istics that are complementary to those of the quantizer 405 
and the DCT circuit 404, respectively, of the encoder shown 
in FIG. 6. 

Macro blocks of motion prediction errors, formed by 
combining a square arrangement of four adjacent blocks 
from the inverse DCT circuit 704, are fed to one input of the 
adder 705, the other input of which is fed with motion- 
compensated macro blocks derived from one or more ref- 
erence pictures by the predictor 711. The output of the adder 
705, a reconstituted macro block of the current picture, is fed 
into one of the field memories in the field memory set with 
motion compensation consisting of the predictor 711 and the 
field memories 707 to 710. The reconstituted pictures stored 
in the field memories 707 to 710 serve as reference pictures 
for decoding later pictures, and are also fed out from the field 
memories with suitable timing by the selector 706 to form a 
picture of the decoder output signal VO 1. 

The display address generation circuit 713 supplies a 
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display address to the field memories 707 to 710. A frame 
pulse signal from a sync signal generation circuit 712, which 
generates sync signals in response to an external sync signal, 
is supplied to the display address generation circuit 713. 

The field mode change signal FMC extracted by the 
variable length decoder 702, and the decoder output signal 
VOl are fed into the rate conversion circuit 113. When the 
signal FMC indicates that a field was removed from the 
coder input signal, the rate conversion circuit duplicates the 
corresponding field in the decoder output signal to provide 
an output signal with a field rate of 60 Hz. The signal from 
the rate conversion circuit 113 passes to the field order 
rearranging circuit 114 where the field order of the signal 
from the rate conversion circuit is restored to that of the 
coder input signal, and provides the resulting signal as the 
decoding apparatus output signal VO, which has a field rate 
of 60 Hz. 

The second embodiment of the present invention will now 
be described with reference to FIG. 9 which shows a block 
diagram of the coding apparatus 100 and of the decoding 
apparatus 101. In FIG. 9, elements corresponding to those 
shown in FIG. 3 are indicated by same reference characters. 

The coding apparatus 100 will be described first The 
coder input signal VI, a video signal with a field rate of 60 
Hz, is fed into the 2-3 pull-down detection circuit 102, 
where the field mode change signal FMC is generated each 
time a duplicate field is detected. In response to the field 
mode change signal, the rate conversion circuit 103 removes 
each duplicate field from the coder input signal VI, and 
sends the resulting video signal to the field order re-arrange- 
ment circuit 104. 

The field order re-airangement circuit 104 changes the 
order of the fields after rate conversion to that required by 
the encoder 105. The encoder 105 compresses and codes the 
picture signal after field-order rearrangement, and feeds the 
resulting coded signal to the ECC circuit 106, which adds 
error correction codes. Hie modulation circuit 107 modu- 
lates the signal from the ECC circuit 106 for recording on 
the recording medium 108. In addition, in the second 
embodiment, control signals, such as DSO or DFN, which 40 
will be described below, indicating the method by which the 
frame is to be displayed, are included in the signal recorded 
on the recording medium 108. 

The decoding apparatus 101 will now be described. The 
signal recorded on the recording medium 109, which is 45 
derived from the recording medium 108, is reproduced, 
demodulated by the demodulation circuit 110, and fed into 
the ECC decoding circuit 111, where error detection and 
correction are applied. The decoder 112 decodes the signal 
from the ECC circuit into a video signal having a frame rate 50 
of 24 Hz. 

The rate conversion circuit 113 generates addressing 
information for feeding to the decoder 112 to return the 
picture order of the video signal generated by the decoder 
112 to that of the coder input signal VI, and to convert the 
rearranged signal into a video signal having field rate of 60 
Hz. The decoder provides the resulting signal as the decod- 
ing apparatus output signal VO with a field rate of 60 Hz. 

The operation and construction of the 2-3 pull-down 
detection circuit 102 of the present embodiment are similar 60 
to those of the first embodiment described above, and so will 
not be described again here. 

While also the operation of the rate conversion circuit 103 
is similar to that described above with reference to FIG. 5, 
the signals generated by the rate conversion circuit 103 in 65 
the second embodiment will be described with reference to 
FIG. 10. 
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The rate conversion circuit 103 of the second embodiment 
receives the field mode change signal FMC, described 
above, from the 2-3 puU-down detection circuit 102. When 
the rate conversion circuit 103 detects that the FMC signal 
is in its 1 state, it does not feed the corresponding duplicate 
field from the coder input signal to the field order re- 
arrangement circuit 104. On the other hand, when the rate 
conversion circuit detects that the FMC signal is in its 0 
state, it feeds fields of the coder input signal unchanged to 
the field order re-arrangement circuit 104. 

In addition, the rate conversion circuit 103 of the second 
embodiment generates a top_field_first flag DSO, which 
indicates the order in which the fields of the frame are to be 
displayed. The DSO flag is a 1-bit flag that can only have the 
values of 0 or 1 . In its 1 state, the flag DSO indicates that the 
first field of the video signal of the frame to which the flag 
pertains is to be displayed first and the second field of the 
video signal is to be displayed second. On the other hand, in 
its 0 state, the flag DSO indicates that the second field of the 
video signal of the frame to which the flag pertains is to be 
displayed first and the first field is to be displayed second. 
Conventionally, the first-displayed field is an odd field. 

The rate conversion circuit 103 also generates a number_ 
of_field_displayed_code flag DFN, which indicates 
whether the frame to which the flag pertains is to be 
displayed as two fields or as three fields. Again, the DFN flag 
is a 1-bit flag that can only have the values of 0 or 1 . In its 
1 state, the flag DFN indicates that the frame to which the 
flag pertains is to be displayed as three fields. On the other 
band, in its 0 state, the flag DFN indicates that the frame to 
which the flag pertains is to be displayed as two fields. 

It can be seen in FIG. 10 that the 2-3 pull-down detection 
circuit 102 (FIG. 9) generates the field mode change signal 
when it detects the duplicate fields 4 and 9. The field 0 is a 
top field, so, in the output frame (a), corresponding to the 
film source frame A, the top_field_first flag DSO is in its 
1 state, indicating that the first field of the frame is to be 
displayed first Also, the output frame (a) is derived from 
only two fields of the coder input signal VI, so the number^ 
of_field_displayed_code flag DFN is set to its 0 state. 

The first field (field 2) of the output frame (bX corre- 
sponding to the film source frame B, is a top field, so the 
top__field__first flag DSO is set to its 1 state, indicating that 
the first field (field 2} of the frame is to be displayed before 
the second field (field 3). The output frame (b) is derived 
from three fields (fields 2, 3, and 4) of the coder input signal 
VI, so the number__oL.field_displayed_code flag DFN is 
set to its 1 state to indicate that the output frame (b) must be 
displayed as three fields. 

The first field (field 5> of the output frame (c), corre- 
sponding to the film source frame C, is a bottom field, so the 
top_field_first flag DSO is set to its 0 state, indicating that 
the second field (field 6, a top field) of the output frame (c) 
is to be displayed after the first field (field 5). The output 
frame (c) is derived from only two fields of the coder input 
signal VI, so the number_of_field_displayed_code flag 
DFN is set to its 0 state to indicate that the output frame (c) 
must be displayed as two fields. 

Finally, the first field (field 7) of the output frame (d), 
corresponding to the film source frame D, is a bottom field, 
so the top_field_first flag DSO is set to its 0 state, indi- 
cating that the second field (field 8, a top field) of the output 
frame (d) is to be displayed after the first field (field 7). The 
output frame (d) is derived from three fields (fields 5, 6, and 
7) of the coder input signal VI, so the number_of_field__ 
displayed_code flag DFN is set to its 1 state to indicate that 
the output frame (d) must be displayed as three fields. 
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The rate conversion circuit 103 feeds the flags DSO and 
DFN to the encoder 105, and to the field order re-arrange- 
ment circuit 104. 

Construction of the field order re-arrangement circuit 104 
is shown in FIG. 11. The field order re-arrangement circuit 5 
104 consists of a set of plural field memories 161 and the 
address controller 162. 

The picture signal from the rate conversion circuit 103 is 
fed into the field order-rearrangement circuit 104, and is first 
recorded in the field memory set 161 at an address desig- 10 
nated by the address controller 162. Then, the picture signal 
at the address designated by the address controller 162 is 
read out from the field memory set 161 and is fed to the 
encoder 105. 

The address controller 162 generates addresses in 15 
response to the picture coding type signal PCT, the macro 
block address ABL, and the top_fieId_first flag DSO. The 
picture coding type signal PCT is generated by the picture 
coding type generator 420 in the encoder 105. The macro 
block address ABL is generated by the blocking circuit 401, 20 
also in the encoder 105. The top_field__first flag DSO is 
generated by the rate converter 103, 

The field memory set 161 stores several fields. The 
address controller 162 refers to the signals PCT, ABL and 
DSO, generates an address where a picture signal received 25 
from the rate converter 103 will be written in the field 
memory set 161, and feeds the address to the memory set 
161. The picture signal received from the field order re- 
arrangement circuit 104 is then written into the memory set 
161 in accordance with the address. 30 

Also, the address controller 162 refers to the signals PCT, 
ABL, and DSO, generates an address in the field memory set 
161 where the macro block of the present picture signal to 
be fed to the encoder 105 is recorded, and feeds the address 
to the memory set 161. The macro block of the present 35 
picture signal read out of the field memory set 161 in 
accordance with the address is fed into to the encoder 105. 
By changing the order of the read out addresses relative to 
the recording addresses, the fields received from the rate 
conversion circuit 103 can be rearranged to provide the field 40 
order required by the encoder 105. Moreover, by reading 
alternate lines from consecutive fields, the field order rear- 
rangement circuit can convert two interlaced fields into a 
single non-interlaced frame for frame mode coding. 

FIG. 12 shows a block diagram of the encoder 105 of the 45 
second embodiment in which components corresponding to 
those in the encoder described above with reference to FIG. 
6 are indicated by same reference characters. 

In the encoder 105, the blocking circuit 401 generates the 
address ABL of each macro block of, preferably, 16x16 50 
pixels, in the frame, and feeds the address to the field order 
re-arrangement circuit 104. The field order re-arrangement 
circuit 104 reads out from the field memory set 161 the 
macro block of pixels indicated by each macro block address 
ABL, and feeds the macro block of pixels as the input signal 55 
VI4 into the encoder 105. The signal VI4 passes through the 
blocking circuit 401, and the motion detection circuit 402 
into one input of the difference detector 403. 

The difference detector 403 also receives a motion-com- 
pensated macro block of pixels corresponding to each macro 60 
block of pixels in the input signal V14. The motion-com- 
pensated macro blocks of pixels are supplied by the field 
memory set with motion compensation formed by the field 
memories 411 to 414 and the predictor 415 described above 
with reference to FIG. 6. The difference detector 403 deter- 65 
mines the pixel-by-pixel differences between each macro 
block of pixels in the input signal VI4 and the corresponding 
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motion-compensated macro block of pixels received from 
the predictor 415. 

The macro blocks of motion prediction errors from the 
difference detector 403 are fed to the DCT circuit 404, which 
orthogonally transforms blocks of motion prediction errors 
obtained by dividing each macro block by four. The DCT 
circuit 404 preferably applies a discrete cosine transform 
(DCT) to each block. The DCT coefficients provided by the 
DCT circuit 404 are fed to the quantizer 405 where they are 
quantized using an adaptively-allocated number of bits. The 
quantized DCT coefficients are then fed to the variable- 
length coder 406, where variable-length coding such as 
Huffman coding, or run-length limited coding, is applied. 
The output of the variable-length coder 406 is fed into the 
encoder buffer 407, which provides the compressed output 
signal VC1, normally at a constant bit rate. The buffer 
supervisory circuit 1017, which will be described below, 
prevents overflow or underflow of the encoder buffer 407 by 
feeding the signal OVF back to the quantizer 405 to control 
the number of bits generated by the quantizer 405. 

The quantizer 405 also feeds the quantized DCT coeffi- 
cients to the field memories 411 to 414 with motion com- 
pensation via the dequantizer 408, the inverse DCT circuit 
409, the adder 410, and the selector 417. The dequantizer 
reverses the quantizing performed by the quantizer 405, and 
the inverse DCT circuit 409 reverses the DCT processing 
performed by the DCT circuit 404. The adder 410 reconsti- 
tutes a macro block of the current picture by adding each 
macro block of reconstituted motion prediction errors from 
the inverse DCT circuit 408 to a motion-compensated macro 
block of a reference picture derived from one or more earlier 
pictures stored in the field memories 411 through 414 by the 
predictor 415. After the current picture has been completely 
reconstituted, it may then be stored in one of the field 
memories 411 through 414 selected by the selector 417 to 
serve as a reference picture for coding later pictures. 

The macro blocks of the input signal VI4 are also fed to 
the motion detection circuit 402 which determines a motion 
vector for each macro block, and also generates an absolute 
value difference sum for each macro block. The motion 
detection circuit 402 feeds the absolute value difference sum 
to the motion prediction mode determination circuit 418. 

The three available prediction modes are selected as 
described above with reference to FIG. 6. 

In the second embodiment, the motion prediction mode 
determination circuit 418 supplies the prediction mode PM 
and the motion vectoT MV to the predictor 415 of the field 
memory set with a motion compensation, and to the read 
address generation circuit 1016. The read addresses gener- 
ated by the address generation circuit 1016 in accordance 
with the prediction mode PM and the motion vector M V arc 
supplied to* the field memories 411 to 414. The address 
generation circuit 1016 generates field memory addresses 
that are offset from the pixel addresses of the current macro 
block by the amount specified by the motion vector MV. 
Macro blocks of pixels are read out from the field memories 
according to addresses supplied by the read address genera- 
tion circuit 1016, and are supplied to the predictor 415, 
which performs selection and interpolation in accordance 
with the prediction mode PM. Thus, the field memories 411 
to 414 with motion compensation and the predictor 415 
perform motion compensation using the prediction mode 
PM and the motion vector MV. 

In the second embodiment of the encoder 105 shown in 
FIG. 12, the picture coding type generation circuit 420 
determines whether each frame should be coded using 
intra-frame coding (I-picture), forward prediction coding 
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(P-picture), or bidirectional prediction coding (B-picture). 
The picture_coding_type signal PCT T generated by the 
picture coding type generation circuit 420, indicates the 
picture coding type for each frame. The number of pictures 
between successive I-pictures, between successive P-pic- 5 
hires, and between an I-ptcture and the first following 
P-picture may be set to predetermined values. For example, 
an 1-picture may be provided every 15 frames, and a 
P-picture every 3 frames. The two frames between succes- 
sive P-pictures, or between an I-picture and the first follow- 10 
ing P-picture are B-pictures. Alternatively, the number of 
pictures between successive I-pictures, between successive 
P-pictures, and between an I-picture and the first following 
P-picture may be signal-dependent. 

The picture coding type generation circuit 420 feeds the 15 
picture_coding_type to the motion prediction mode deter- 
mination circuit 418, the blocking circuit 401, the variable 
length coder 406 and the temporal reference generation 
circuit 421. The temporal reference generation circuit gen- 
erates the temporal__reference signal for feeding into the 20 
variable-length coder 406. The temporal^ reference signal is 
a signal associated with each input picture and indicates the 
order in which the pictures in a Group of Pictures (GOP) are 
to be displayed, as will be described in detail below. The 
temporal_reference is fed from the temporal reference 25 
generation circuit 421 to the variable length coder 406. 

The variable-length coder 406 of the second embodiment 
will now be described. The variable-length coder 406 adds 
a header to the coded video signal for each picture to prepare 
a signal for recording on the recording medium 108. When 30 
the signal recorded in the recording medium has a frame rate 
of 24 Hz and is derived from a motion picture film source via 
a video signal with a 60 Hz field rate obtained using 2-3 
pull-down, as described above, the signal can be recorded on 
the recording medium 108 using either of the following two 35 
recording methods. 



In the first method, one or more control signals are 
recorded as part of the signal recorded on the recording 
medium to indicate which field of which frame should be 
repeated when the recording is reproduced to provide an 
output video signal with a field rate of 60 Hz. In the second 
method, no such control signal is recorded, and, when the 
recording is reproduced, the decoder performs an automatic 
2-3 pull-down process to provide an output video signal 
with a field rate of 60 Hz. 

Two variations on the first recording method, in which a 
flag or control signal indicates which field should be 
repeated, will first be described. 
Fust Recording Method—Variation 1 

The 2-3 pull-down detection circuit 102 sets the state of 
the field mode change signal FMC to 1 each time it detects 
a duplicate field in the input video signal. Accordingly, in the 
first variation on the first recording method, the FMC signal 
is used as the control signal to indicate the frames in the 
recording from which three fields should be generated when 
the recording is reproduced. In the first variation of the first 
recording method, the FMC signal is added to, and is 
recorded together with, the picture header of those frames 
from which three fields should be generated. The FMC 
signal could be recorded in the picture coding extension of 
the picture header (these terms will be described in more 
detail below). 

Before the second variation on the first recording method 
and the second recording method are described, a descrip- 
tion of some of the header syntax and the buffering arrange- 
ments defined in the MPEG-2 standard for the coding 
apparatus and the decoding apparatus will be described. 

The syntax of an MPEG-2 video sequence is shown in 
Table 1. The mathematical operators and syntax of Table 1 
are similar to those used in the C programming language. 
The terms used in Table 1 are defined in the working draft 
of ISO/DEC Recommendation H.26x for Generic Coding of 



TABLE 1 



video_sequenccO { 



No. of bits Mnemonic 



nexL_start_codeO 
sequencc_headerO 

if (nextbitsO = extenaon_start_code { 

sequence extcnsionO 

do{ 



do{ 

if (n«t_bitsO => group_start_code) { 
group_of__picturts_headeK) 
txtcnsion_and_uscr_dalfl( 1 ) 

). 

pictaie_JieadexO 

exicmions_and_user daia(2) 

pktuitL_dmaO 
}whifc ( (next_bitsO = picture_stait_code) || 
ncxOritsO = gp>up_8tart_code) ) 
if (nextbitsO!= sequence_cnd_code) { 

sccjncncc cjitcnsiooO 

> 

} while (nextbit$0!= scquence_end_code) 
>else{ 
do{ 

do{ 

group of pictures htad£rsO 

if (next_bitsO = use_daia_start_code) 
user_dataO 

do { 

pictnrt_headerO 

if (next_bitsO = uscr_daia_start_code) 
user_dataO 
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TABLE 1 -continued 



video_sequenceO { No. of bits Mnemonic 



picture_dataO 
> while (nexl_bitsO = pieture_stan_code) 
Jwhile (next_bilsO = group_shirt_.code) 
if (nextbitsO N= sequcnce_end_codc) 
sequcncc_headerO 
Jwhilc (nextbitsO != sequence_end_code) 

> 

sequence_end_code 



TABLE 2 



TABLE 4-continued 





No. or 




sequence_headers { 


bits 


Mnemonic 


sequencc_headcr_code 


32 


bslbf 


horizonla]_size_vahie 


12 


uimsbf 


vcmcnl_si2c_value 


12 


nimsbf 


pel_aspect_ratio 


4 


uimsbf 
uimsbf 


frame. . rate 


4 


bii_raie 


18 


uimsbf 


marker bil 


1 


-r 


vbv_buffer_size 


10 


uimsbf 


con strained_paramctcr_ flag 


1 




load_Jnlra_quantizer_malrix 


1 




if (load_intra_quantizer_matrix 






intra_quanrizcr_mairix| 64J 


8*64 


uimsbf 


load_non_intra_quantizer_matrix 


1 




if Ooad_non_Jnlra_quanlizcr_matrix 






non _incra_quanti2er_malrix| 64 J 


8*64 


uimsbf 


ncxl_st2rt_codcO 






TABLE 3 


sequence_extensionO { No. 


of bits 


Mnemonic 



} 



extension_slarl_code 32 

extension_starL_codc_identificr 4 

pro file_and _le vel_i ndicati on 8 

non_inierfaced_ sequence 1 

cbromn_formai 2 

horizonial_size_ex tension 2 

vertical_size_ex tension 2 

bil_raie_cxtension 12 

marker 1 

vbv_buffer_sizc_cx tension 5 

frame_raie__exiension 8 
next_stnrt_codeO 



bslbf 

uimsbf 

uimsbf 

uimsbf 

uimsbf 

uimsbf 

uimsbf 

uimsbf 

uimsbf 



20 



25 



30 



35 



40 



45 



50 



Moving Pictures and Associated Audio, which is incorpo- 
rated herein by reference. Table 2 shows the syntax of the 
MPEG-2 sequence header referred to in Table 1, and Table 
3 shows the syntax of the MPEG-2 sequence extension 
referred to in Table 1. 

The frame_rate field of the sequence header shown in 55 
Table 2 is 4 bits long and defines the frame rate of the video 
signal in the video sequence. The possible states of the frame 
rate field are shown in Table 4. 



TABLE 4 



frame rale 



frames per second 



0000 
0001 
0010 
0011 
0100 



forbidden 
23.976 
24 
25 

29.97 



60 



65 



frame rale 


frames per second 


0101 


30 


0110 


50 


0111 


59:94 


1000 


60 




reserved 


1111 


reserved 



Included in the sequence extension shown in Table 3 is the 
non_interlaced_sequence flag, the state of which indicates 
whether the video signal in the video sequence is interlaced 
or progressive (i.e., non-interlaced). The non_interlaced_ 
sequence flag is set to its 1 state when the video sequence 
contains only progressive pictures, otherwise, the non_ 
interlaced__sequence flag is set to its 0 state. When the 
non_jnterlaced_sequence is in its 0 state, the frame_rale 
represents the number of frames per second of the intended 
display sequence. When the non_Jnterlaced_sequence is in 
its 1 state, the frame_rate specifies the number of non- 
interlaced frames per second, and, consequently, the number 
of progressive pictures per second. 

The sequence header shown in Table 2 and the sequence 
extension shown in Table 3 also include the vbv__buffer__ 
size field and the vbv_buffer_size_extension field, respec- 
tively. The contents of the vbv_buffer_size field and the 
vbv__buffer_size_extension field together provide data 
from which the size B of the VBV buffer can be calculated, 
as will be described below. The Video Buffering Verifier 
(VBV) is a hypothetical decoder which is conceptually 
connected to the output of the coding apparatus. The VBV 
includes a hypothetical buffer having a size defined by the 
VBV buffer size. The output signal of the encoder is fed into 
the VBV buffer at the constant bit rate being used. Signal is 
removed from the VBV buffer according to rules that will be 
set forth in detail below. It is a requirement of an MPEG 
coding apparatus that the bit stream that it produces shall not 
cause the VBV buffer to either overflow or underflow. Thus, 
the VBV buffer size B defines the minimum buffer size 
required to decode the output signal generated by the coding 
apparatus. More information on the VBV is set forth in 
Annex C of the working draft of ISO/IEC Recommendation 
H.26x. 

The ten least significant bits of the vbv_buffer_size arc 
located in the vbv_buffer_size field of the sequence header 
shown in Table 2. The five most significant bits of the 
vbv_buffcr__size are located in the vbv__buffer_sizc_ex- 
tension field in the sequence extension shown in Tabic 3. 
The five bits from the vbv_buffer__size extension field arc 
combined with the ten bits from the vbv_buffer_size field 
to generate a 15-bit integer called vbv_bufler_sizc. The 
size B of the VBV buffer is then calculated from the 
vbv_buffer_sizc as follows: 
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fl=l 6x1 ,024xvbv_buffer__size 



In the video sequence defined above in Table 1, a picture 
header and a picture coding extension, each including sev- 
eral fields, precede the video signal of each picture. The 
syntax of the picture header and of the picture coding 
extension is shown Table 5 and Table 6, respectively. 



18 



TABLE 6-continued 



picture_coding__exteosionO{ 



No of bits Mnemonic 



sob_caiiier 
burst_anjphtode 



uimsbf 



TABLE 5 



picture_htadcrO } 



No. of bits Mnemonic 



picture_start_code 
temporal_ic f erence 
picture_coding_type 
vbv_delay 

if (pictnre_coding_type = 20 pictorc_coding_type = 3) { 
fnlL_pel_forward_vector 
forwaroLJL_code 

> 

if (picmre_coding_type = 3 { 
mU_peI_backward_vector 
backward __f_code 

} 

while (nextbitsO = { 

cxtnL_bit picture 

extra__tnfonnation l picture 

} 

extra_bit picture 

next start codeO 



32 
10 
3 
16 



bslbf 
uimsbf 
nimsbf 
uimsbf 



uimsbf 

uimsbf 
T 



A number of the fields in the picture header shown in 
Table 5 will now be described. 

The temporal_reference field is a 10-bit field, the con- 
tents of which indicate the display order of the picture to 
which the picture header belongs (the order of the pictures 35 
in the video sequence is not the same as the order in which 
the pictures will be displayed). A picture counter is incre- 
mented by one for each input picture to provide the tempo- 
ral_reference. The temporal_reference counter is reset to 
zero for the first picture of each group of pictures, or if it 40 
reaches 1024. When a frame is coded as two fields, the 
temporal_reference is the same for both fields. The pic- 
ture__coding_type code is a 3-bit field, the contents of 
which identify how the picture to which the picture header 
belongs has been coded, 45 



TABLE 6 



picture_coding_extensionO{ 


No of bits Mnemonic 


extensi on_start_code 


32 bstbf 


extension_fd 


4 mmsbf 


forward horizoixtal_f_code 


4 uimsbf 


forward^verti cal_f__code 


4 uimsbf 


backward __JiorizonLal__(_codc 


4 uimsbf 


backward_YCrtical_f_.codc 


4 uimsbf 


inlra^_dc precision 


2 nimsbf 


piclure_structurc 


2 uimsbf 


top_6rid__fim 


1 uimsbf 


frame_pred frame dct 


1 uimsbf 


concealment motion vectors 


1 uimsbf 


q # _scale_type 


1 uimsbf 


intra_vlc_jFcrmal 


] uimsbf 


oltcmifltc, 5£ft^ 


1 uimsbf 


nnmber_of_field_di splayed_code 


1 uimsbf 


chrorna_postprocessing type 


1 uimsbf 


non_inierlaced__frame 


1 uimsbf 


composite_dj$play_flag 


1 uimsbf 


if (composite_display_flag { 




v-axis 


1 uimsbf 


field__sequence 


3 uimsbf 



50 



TABLE 6-continued 



picture coding extensionO{ 


No of bits Mnemonic 


sub_cairier_phase 

> 

nexl_start_codeO 

} 


8 uimsbf 





i.e», whether the picture has been coded using intra-picture 
coding (1-picture), prediction coding (P-picture), or bidirec- 
tional prediction coding (B-pictureX or whether only the DC 
components resulting from intra-picture coding have been 
coded (D-picture). The possible states of the picture_cod- 
ing— type field are shown in Table 7. No D-picture may in a 
video sequence together with a picture of any other type. 



TABLE 7 


picture coding type 


coding method 


000 


forbidden 


001 


intra- coded (I) 


010 


predictive-coded (P) 


011 


biritfectionaJfy- predictive-coded (B) 


100 


dc intrn-coded (D) 


101 


reserved 


110 


reserved 


111 


reserved 



The vbv__delay field is a 16-bit field, the contents of 
which are used when the encoder provides an output signal 
with a constant bit rate. The vbv_delay defines the initial 
occupancy of the decoder buffer at the start of decoding to 
prevent the decoder buffer from underflowing or overflow- 
ing. The vbv_delay is defined in terms of the time required 
to fill the VBV buffer at the target bit rate R from an initially 
empty state to the desired initial occupancy before the video 
signal of the current picture is removed from the buffer. The 
vbv_delay is the number of cycles of the 90 kHz system 
clock that the VBV should wait after receiving the final byte 



65 
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of the picture_starL_code in the picture header. The vbv_ 
delay may be calculated from the state of the VBV buffer as 
follows: 

vbv_dclay w =9O,000xB pl */R 

In the above equation: 

n>0, 

B„* is the VBV buffer occupancy immediately before 
removing the video signal of the picture n from the 
buffer but after removing any COP layer and sequence 
header preceding the picture n, and 

R is the bit rate indicated by bit__rate in the sequence 
header. 

A number of the fields of the picture coding extension 
shown in Table 6 will now be described. 

The picture_structure field is a 2-bit field, the contents of 
which indicate whether the picture is a frame picture, or, 
otherwise, whether the picture is the top field or the bottom 
field of a frame consisting of two fields. The possible states 
of the picture_structure field are shown in Table 8. 

TABLE 8 



picture structure 


Meaning 


11 


Frame-Picture 


01 


Top Held 


10 


Bottom Fieid 


00 


reserved 



10 



15 



20 



25 



30 



35 



40 



45 



The significance of the state of the top_field_first flag 
depends upon the picture structure indicated in the picture_ 
structure field. When the rrame_structure indicates that the 
picture is a frame picture, the top__field__first .flag in its 1 
state indicates that the top field of the frame is to be 
displayed first On the other hand, the top_field_first in its 
0 state indicates that the bottom field of the frame is to be 
displayed first. In a field structure picture, or in a progressive 
frame-structure picture in which the non_inter)aced_se- 
quence flag is set to its 1 state, the top__field_Jirst flag is 
always set to its 0 state. 

The number_of_field_displayed_code flag that indi- 
cates the number of fields in which the picture is to be 
displayed. When the flag is set to its 1 state, the picture is to 
be displayed as three fields. When the flag is set to its 0 state, 
the picture is to be displayed as two fields. If the picture is 
a progressive picture for which the picture_structure code is 
1 1 and the non_interlaced_sequence flag is in its 1 state, the 
number_of_field_displayed_code flag must be set to its 0 50 
state. A frame consisting of field pictures is always displayed 
in two fields. 

Control of the encoder buffer by the buffer supervisory 
circuit 1017 will now be described with reference to FIGS. 
13, 14 and 15. 55 

First, referring to FIG. 15, the buffer supervisory circuit 
1017 of the second embodiment controls bit allocation in the 
variable-length coder 406 to prevent the decoder buffer 804 
(corresponding to the buffer 701 in the decoder shown in 
FIG. 16) from overflowing or underflowing when the output 60 
signal generated by the coding apparatus is decoded. The 
buffer supervisory circuit operates by hypothetically con- 
necting the above-mentioned hypothetical video buffering 
verifier (VBV) buffer 811 to the output of the coding 
apparatus. The output signal generated by the coding appa- 
ratus is fed into the hypothetical VBV buffer 811. The video 
signal of each picture stored in the hypothetical VBV buffer 
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is read out of the VBV buffer in accordance with the rules 
set forth below, and in response to the contents of the 
vbv_delay field. The buffer supervisory circuit 1017 moni- 
tors the state of the hypothetical buffer 811 and controls bit 
allocation in the variable-length coder to prevent the hypo- 
thetical VBV buffer from overflowing or underflowing. 

The buffer supervisory circuit controls the variable-length 
coder video bit stream so that the output signal of the coding 
apparatus satisfies the following video buffering verifier 
requirements: 

(1) The VBV and the coding apparatus have the same 
clock frequency and the same picture rate, and are 
operated synchronously. 

(2) The VBV has a VBV buffer of size B, where B is 
calculated as described above from the vbv_buffer_ 
size in the sequence header and the vbv_buffer__sizc_ 
extension in the sequence header extension. 

(3) The VBV is initially empty, and is filled with the 
output signal from the coding apparatus for the time 
specified by the vbv_delay in the picture header. 

(4) All of the video signal for the picture that has been in 
the VBV buffer the longest is removed instantaneously. 
Then, after a time t calculated from the picture_rate in 
the sequence header, the picture_structure in the pic- 
ture coding extension, and the number_of_field_ 
displayed__code in the picture header of the last picture 
decoded, all of the video signal for the picture which, 
at that time, has been stored in the buffer longest is 
instantaneously removed. The period of time t is 
defined as follows: 

*=6dd_coant/(field_pCT_picturcxF) 

Where: 

field_per__picture=2 when the picture__structure=l 1 , i.e., 

in the case of a frame structure, or 
field__per_picture=l when the picture_structure has a 

value different from 1 1 ; 

P=the number of pictures per second calculated from the 
picture_rate; and 

field_count is the number of displayed fields calculated 
from the number_of__fields__displayed__code flag in 
the picture header of the last picture displayed. 

The sequence header and the GOP header immediately 
preceding a picture are removed simultaneously with the 
picture. The VBV is checked.immediately before any data or 
signal are removed. Each time the VBV buffer is checked, its 
occupancy must He between 0 and B bits, where B is the 
VBV buffer size in bits calculated from the vbv__buffer_ 
size and the vbv_buffer_size extension as described above. 

The second variation of the first recording method and the 
second recording method will now be described. 
First Recording Method — Second Variation 

The MPEG-2 syntax makes no official allocation of a field 
in the picture header for storing the FMC signal required by 
the first variation on the first recording method. Thus, the 
second variation on the first recording method uses control 
signals and flags that conform to the official MPEG-2 syntax 
to indicate which fields of which frames of the recorded 
signal should be duplicated when the recording is repro- 
duced as an interlaced video signal with a 60 Hz field rale. 
In the MPEG-2 syntax, the non_intcrlaced_sequencc flag, 
indicating whether or not that all the pictures in the video 
sequence are non-interlaced pictures, and the framc_ralc 
field, the contents of which indicate the picture rale, arc 
fields in the sequence header that begins each video 
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sequence. In the second variation of the first recording 
method, the variable length coder 406 sets the frame_rate to 
24 Hz or 23.976 Hz, and sets the non_interlaced_sequence 
to 0. 

The second variation on the first recording method uses 
the Hags DSO (top_field_first) and DFN (number_of_ 
field_displayed_code) provided by the rate conversion 
circuit 103 as flags to indicate which field should be repeated 
in the decoding apparatus output signal VO. The rate con- 
version circuit 103 feeds the flags DSO and DFN to the 
variable-rate coder 406 where they are entered into the fields 
allocated by the MPEG-2 standard in each picture header in 
the video sequence. In the picture header, the flag DSO in its 
1 state indicates that the first field of the picture is to be 
displayed first, whereas the flag DSO in its 0 state indicates 
that the second field of the picture is to be displayed first. 
Additionally, the flag DFN in its O state indicates that the 
picture is to be displayed as two fields, whereas the flag DFN 
in its 1 state indicates that the picture is to be displayed as 
three fields. 

Second Recording Method 

The second recording method, which provides a signal 
that the decoding apparatus decodes by automatically per- 
forming 2-3 pull-down, will now be described. 

The second recording method sets the non_interlaced_ 
sequence flag to its 1 state, and the frame_rate to 24 Hz or 
23.976 Hz. Because of the state of the non_interlaced_ 
sequence flag, the top__field_first flag is always set to 0. 
Additionally, the number__of_field_displayed_code flag is 
set to 0. No signals are included in the encoder output signal 
to indicate which fields are to be duplicated in the decoder. 
When the rate conversion circuit in the decoder recognizes 
this combination of the non_interlaced_sequence flag and 
the frame_rate, it automatically performs 2-3 pull-down, as 
will be described below. 

The effect of the second recording method, in which the 
decoding apparatus automatically performs 2-3 pull-down, 
on the bit rate of the output signal from the encoder will now 
be described. 

The second recording method does not control the state of 
the number_of_field_displayed__code flag in the picture 
header, and the decoding apparatus 101 automatically per- 
forms 2-3 pull-down to provide a decoding apparatus output 
signal with a 60 Hz field rate for display. As a result of this, 
as shown in FIG. 13, since the output signal of the encoder 
includes a different number of pictures per second from the 
number of pictures per second in the output signal of the 
decoder, the requirements for the VBV buffer set forth above 
are not satisfied. Hence, if the buffer supervisory circuit 
1017 controls the quantizer 405 in the coding apparatus 100. 
based on the assumption that the coding apparatus is feeding 
the VBV buffer, an overflow or an underflow may possibly 
occur in the actual buffer in the decoding apparatus 101. 
Accordingly, when the second recording method is used, 
countermeasures to prevent possible overflow or underflow 
of the buffer in the decoding apparatus must be taken in the 
encoder 105. 

The requirements for using the second recording method 
are: 

(1) The VBV buffer size B must be calculated using a 
vbv__buffer_size obtained by multiplying the vbv_ 
buffer_size in the sequence header and the sequence 
extension by Vs ( 4 /s corresponds to the ratio of the frame 
rate between the coding apparatus and the decoding 
apparatus). $5 

(2) A vbv_delay must be chosen by considering both the 
case in which the video signal of the first frame of a 
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video sequence is displayed as three fields and the case 
in which the video signal of the first frame is displayed 
as two fields. 

FIGS. 13 and 14 will now be described. The distance 
between the solid sloped parallel lines in each figure repre- 
sents the buffer size. The inclination of the parallel lines in 
each figure represents the bit rate of the output signal of the 
coding apparatus or the input signal of the decoding appa- 
ratus. The solid stepped line in each figure shows how the 
encoder 801 transfers the video signal of each picture into 
the encoder buffer 802. As described above, all of the video 
signal for each picture is deposited instantaneously into the 
encoder buffer at each picture period of 24 Hz. The broken 
stepped lines in each figure show how the decoder 805 
withdraws the video signal for each frame from the decoder 
buffer 804. As described above, after the delay time defined 
by the vbv__delay, all of the video signal for each picture is 
withdrawn instantaneously from the decoder buffer at each 
picture period of 30 Hz. The buffer supervisory circuit 1017 
ensures that each solid stepped line is maintained within the 
associated parallel lines. 

When the second recording method is used, the distance 
between the broken lines in FIG. 14 represents a buffer 
capacity B' (calculated from vbv_buffer__size xVs). In this 
instance, buffer control is performed so that the centers of 
areas bounded by the sloped parallel broken lines and the 
sloped parallel solid lines coincide with each other. 

In this manner, by causing the buffer supervisory circuit 
1017 to reduce the size of the VBV buffer compared with the 
capacity of the actual decoder buffer, the second recording 
method can be used without the risk of overflow or under- 
flow in the decoder buffer. However, the reduction in the size 
of the VBV buffer may result in fewer bits being allocated 
to some pictures than if bits were allocated based on the 
VBV buffer having its full size. This may result in some 
impairment of the picture quality. 

The decoder 112 of the second embodiment will now be 
described with reference to the block diagram shown in FIG. 
16. Components in FIG. 16 that correspond to components 
in the decoder shown in FIG. 8 described above are indi- 
cated by the same reference characters. The input signal 
VD3 from the ECC decoder circuit 111 is temporarily stored 
in the decoder buffer 701, as described above. From the 
decoder buffer 701, the input signal passes through the 
variable length decoder 702, where at least one control 
signal is extracted from the various headers in the input 
signal VD3, as will be described below. The variable length 
decoder also reverses the variable length coding of the DCT 
coefficients carried out in the variable-length coder 406 in 
the coding apparatus. 

Then, each block of quantized DCT coefficients in the 
signal from the variable-length decoder is dequantized by 
the dequantizer 703 using information extracted from the 
input signal VD3 by the variable length decoder 702. Each 
resulting block of DCT coefficients is then orthogonally 
transformed by the inverse DCT circuit 704, which prefer- 
ably applies an inverse DCT. The dequantizer 703 and the 
inverse DCT circuit 704 are constructed to have character- 
istics that are complementary to those of the quantizer 405 
and the DCT circuit 404, respectively, of the encoder shown 
in FIG. 12. 

Each macro block of motion prediction errors from the 
output of the DCT circuit 704 is fed to the adder 705 where 
it is combined with a macro block derived from one or more 
reference pictures by the predictor 711 to regenerate a macro 
block of the current picture. The resulting macro block of the 
current picture is fed into one of the field memories 707 
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through 710 in accordance with an address from the display 
address generation circuit 713. Fully reconstructed pictures 
stored in the field memories 707 through 710 are read out 
with appropriate timing by the display address generation 
circuit 713 to the selector 706, which provides the read out 5 
picture as part of the decoder output signal VOL 

The variable-length decoder 702 also extracts from head- 
ers in the input signal VD3 the various control signals 
described above which it feeds to the field address genera- 
tion circuit 721. When signal reproduced from the recording 10 
medium 109 was recorded using the first recording method, 
and the control signals indicate that a field was removed 
from the coder input signal, this causes the field address 
generation circuit 721 to read out once more from one of the 
field memories 707 through 710 the reconstructed picture 15 
that was read out two pictures earlier. The repeated read out 
picture is fed to the selector 706, which provides the read out 
picture as part of the decoding apparatus output signal VO. 
In this way, the control signal causes the decoder to repeat 
a decoded field to reconstitute each field that was removed 20 
from the coder input signal. 

The rate conversion circuit 113 of the second embodiment 
of the decoding apparatus shown in FIG. 9 will now be 
described with reference to FIG. 16. 

In the rate conversion circuit 113, the field address con- 25 
(roller 721 receives one or more control signals extracted 
from the input signal VD3 to the decoder 112 by the 
variable-length decoder 702, i.e., the field address controller 
receives either the FMC signal or the non_interlaced__ 
sequence flag, frame_rate, top_field_first flag, and num- 30 
ber__of_ field_displayed_code flag. The field address gen- 
erator 721 provides addresses to the selector 706 to cause the 
selector to feed the video signals of the reconstituted pic- 
tures stored in the field memory set 707 through 710 to the 
decoding apparatus output signal VO. 35 

The field address generator 721 also receives the tempo- 
ral_reference signal from the variable-rate decoder 702, 
which enables the field address generator 721 to control the 
selector 706 so that the order of the fields in the decoding 
apparatus output signal VO is the same as that of the coder 40 
input signal VI. 

When the signal recorded on the recording medium 109 
was recorded using the first variation of the first recording 
method described, the variable-rate decoder 702 extracts the 
field mode change signal FMC from the picture header and 45 
feeds it to the rate conversion circuit 113 as the control 
signal. For those frames for which the FMC signal is in its 
1 state, the field address-generator 721 causes the selector 
706 to feed the video signal of the first field of the frame 
from one of the field memories 707 through 710 to the 50 
decoder apparatus output signal a second time, so that the 
frame provides three fields of the decoder apparatus output 
signal VO. Otherwise, the field address generator causes the 
selector 706 to provide two fields of the decoding apparatus 
output signal VO from the frame. When the FMC of the first 55 
frame in a sequence is in the 0 state, two fields are derived 
from the frame as shown for the frame A in FIG. 17. But 
when the FMC of the first frame in the sequence is in the 1 
state, the three fields are derived from the frame, as shown 
for the frame B of FIG. 17. ^ 

When the signal recorded on the recording medium 109 is 
in accordance with the MPEG-2 standard, the signal may 
have been recorded using the second variation on the first 
recording method or using the second recording method. 
The variable-rate decoder 702 extracts the non_interlaced J 65 
sequence, the frame_rate, the top_field_first flag, the num- 
ber_of_field_displayed_code flag, and the temporal_ref- 
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erence from the picture header, and feeds these control 
signals to the field address generator 721. 

When the non_interlaced__sequence is 1 and the frame__ 
rate is 24 Hz or 23.976 Hz, this indicates that the signal 
recorded on the recording medium 109 was recorded using 
second variation of the first recording method. Accordingly, 
the field address generating circuit 721 examines the state of 
the top_field_first flag and the state of the number_of_ 
field_displayed_code flag to determine which field of 
which frame should duplicated in the decoding apparatus 
output signal VO. 

FIG. 18 illustrates how the two flag signals extracted from 
the picture header of each picture in the signal with the 24 
Hz frame rate reproduced from the recording medium 109 
control the generation of the decoding apparatus output 
signal with a 60 Hz field rate. When the top_field_first flag 
(DSO) extracted from the picture header is in its 1 state, and 
the number_of_field_displayed_code flag extracted from 
the picture header is in its 0 state, the field address generator 
721 causes the selector 706 to provide two fields of the 
decoding apparatus output signal VO from the picture signal 
following the picture header. The order in which the selector 
reads selected ones of the field memories 707 through 710 
is such that the first field of the output signal corresponds to 
the top field of the picture signal. 

When the top_field_first flag (DSO>is in its 1 state, and 
the number_of_field_dispIayed_code flag is in its 1 slate, 
the field address generator 721 causes the selector 706 to 
provide three fields of the decoding apparatus output signal 
VO from the picture signal following the picture header. The 
order in which the selector reads selected ones of the field 
memories 707 through 710 is such that the first and third 
fields of the output signal correspond to the top field of the 
picture signal. 

When the top_field_first flag (DSO) is in its 0 state, and 
the number_of_field_displayed._code flag is in its 0 state, 
the field address generator 721 causes the selector 706 to 
provide two fields of the decoding apparatus output signal 
VO from the picture signal following the picture header. 
However, the order in which the selector reads selected ones 
of the field memories 707 through 710 is such that the first 
field of the output signal corresponds to the bottom field of 
the picture signal. 

Finally, when the top_fieId_first flag (DSO) is in its 0 
state, and the number_of_field__displayed__code flag is in 
its 1 state, the field address generator 721 causes the selector 
706 to provide three fields of the decoding apparatus output 
signal VO from the picture signal following the picture 
header. The order in which the selector reads selected ones 
of the field memories 707 through 710 is such that the first 
and third fields of the output signal correspond to the bottom 
field of the picture signal. 

When the state of the non_interlace_sequence flag is 0 
and the frame_rate is 24 Hz or 23.976 Hz, this indicates to 
the field address generator 721 that the signal recorded on 
the recording medium 109 was recorded using the second 
recording method. In this instance, top__field_first flag 
(DSO) remains constantly in its 0 state, and the number^ 
of_field_dispIayed__code flag also remains constantly in its 
0 state. In response to this combination of control signals, 
the field address generator generates an address sequence 
that causes the selector 706 to perform 2-3 pull-down 
without reference to any control signals originating in the 
encoder. When the field address generator causes the selec- 
tor to feed one field of alternate frames to the decoding 
apparatus output signal VO to perform 2-3 pull-down, the 
duplicated fields in the decoding apparatus output signal VO 
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may or may not be the same fields as the duplicate fields in 
the coder input signal VI. 

Finally, the video signal recordings 108 or 109 made by 
or reproduced by the embodiments of the present invention 
at least include, as data relating to removal of a duplicated 5 
field, a field mode change signal (FMC) or telecine conver- 
sion rate information (non_interlaced__sequence and 
frame__rate). Pictures from which duplicated fields were 
removed may be identified using the field mode change 
signal or by the number_of_field__displayed_code flag. 
Identifying such pictures avoids the need to reduce the size 
of the VB V buffer to prevent overflow or underflow of the 
decoder buffer. 

The recordings may be made on such recording media as, 
for example, disk-shaped recording media (optical disks, 
recordable optical disks, hard disks and so forth), tape-based 15 
recording media, semiconductor memories, IC cards and so 
forth. Moreover, the signal generated by the coding appa- 
ratus may be transmitted as a broadcast signal, or via a 
distribution system such as a cable system or telephone 
network. 20 

We claim: 

1. A method for generating a coded video signal by coding 
an input video signal with a field rate of 60 Hz derived from 
a motion picture film source using 2-3 pulldown, the method 
comprising steps of: 25 

detecting duplicate fields in the input video signal; 
generating a control signal having a state indicating each 

of the detected duplicated fields; 
in response to the state of the control signal, eliminating 30 

from the input video signal the duplicate fields detected 

in the detecting step; 

generating a progressive video signal from the fields of 
the input video signal remaining following elimination 
of the duplicate fields in the eHminating step* the 35 
progressive video signal being composed of frames and 
having a frame rate of 24 Hz, the frames of the 
progressive video signal including lost-field frames 
generated from fields of the input video signal that were 
duplicates of the duplicate fields eliminated in the 40 
eliminating step; 

coding the frames of the progressive video signal to 
produce respective frames of the coded video signal, 
the frames of the progressive video signal being coded 
with different numbers of bits; and 4 5 

including the control signal in each of the frames of the 
coded video signal, the state of the control signal 
indicating the frames of the coded video signal result- 
ing from coding the lost-field frames of the progressive 
video signal. 

2. The method of claim 1, wherein the coding step 
comprises steps of: 

orthogonally transforming ones of the frames of the 
progressive video signal to produce respective sets of 
transform coefficients; 

locally decoding the sets of transform coefficients using 
an inverse orthogonal transform to produce locally- 
decoded picture; and 

applying predictive coding to ones of the frames of the 60 
progressive video signal using, for each of the ones of 
the frames of the progressive video signal, a selected 
one of the locally-decoded pictures as a reference 
picture. 

3. The method of claim 1, wherein the method is for 65 
providing a recording signal for recording on a recording 
medium and additionally comprises the steps of: 
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deriving a recording signal from the coded video signal; 
and 

recording the recording signal on the recording medium. 

4. The method of claim 1, wherein: 

the step of coding the frames of the progressive video 
signal to produce respective frames of the coded video 
signal includes a step of adding a picture header to each 
of the frames of the coded video signal; and 

in the step of including the control signal in each of the 
frames of the coded video signal, the control signal is 
included in the picture header of each of the frames. 

5. The method of claim 4, wherein, in the step of including 
the control signal in each of the frames of the coded video 
signal, in lieu of including the control signal in each of the 
frames of the coded video signal, the control signal is 
included in the picture header of only the frames resulting 
from coding the lost-field frames of the progressive video 



6. A method of decoding a coded video signal to provide 
an interlaced output video signal with a field rate of 60 Hz, 
the coded video signal including frames derived by coding 
respective frames of a progressive video signal having a 
frame rate of 24 Hz, the progressive video signal being 
derived from an interlaced input video signal with a field rate 
of 60 Hz by eliminating duplicate fields, the frames of the 
coded video signal including a control signal having a state 
indicating the frames of the coded video signal resulting 
from coding lost-field frames of the progressive video 
signal, lost-field frames being frames of the progressive 
video signal wherefrom a duplicate field was eliminated, the 
method comprising steps of: 

extracting the control signal from the frames of the coded 
video signal; 

decoding the frames of the coded video signal to provide 
respective frames of a reconstructed progressive video 
signal; and 

in response to the control signal extracted from each one 
of the frames of the coded video signal, deriving three 
fields of the interlaced output video signal from the 
respective one of the frames of the reconstructed pro- 
gressive video signal when the control signal indicates 
that the one of the frames of the coded video signal 
resulted from coding one of the lost-field frames, and 
deriving two fields of the interlaced output video signal 
from all others of the frames of the reconstructed 
progressive video signal. 

7. The method of claim 6, wherein: the control signal is 
included only in the frames of the coded video signal 
respectively derived from the lost-field frames of the pro- 
gressive video signal; and 

in the deriving step.three fields of the interlaced output 
video signal are derived from those frames of the 
reconstructed progressive video signal resulting from 
decoding the frames of the coded video signal where- 
from the control signal was extracted in the extracting 
step. 

8. The method of claim 6, wherein: 

frames of the coded video signal include respective sets of 

transform coefficients; and 
the decoding step includes the steps of: 
providing a reference picture, 
deriving the sets of transform coefficients from the 

coded video signal, 
applying an inverse orthogonal transform to the sets of 
transform coefficients to provide respective sets of 
motion prediction errors, and 
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reconstructing one of the frames of the reconstructed 
progressive video signal from the reference picture 
and one of the sets of motion prediction errors. 
9. Apparatus for generating a coded video signal by 
coding an input video signal with a field rate of 60 Hz 
derived from a motion picture film source using 2-3 pull- 
down, the apparatus comprising: 
detecting means for detecting duplicate fields in the input 
video signal; 

control signal generating means for generating a control 
signal having a state indicating each of the detected 
duplicated fields; 

eliminating means, operating in response to the state of 
the control signal, for eliminating the duplicate fields 
from the input video signal; 

generating means, operating on the input video signal 
following elimination of the duplicate fields by the 
eliminating means, for generating a progressive video 
signal composed of frames and having a frame rate of 
24 Hz, the frames of the progressive video signal 
including lost-field frames generated from fields of the 
input video signal that were duplicates of the duplicate 
fields eliminated by the eliminating means; and 

coding means for coding the frames of the progressive 
video signal to produce respective frames of the coded 
video signal; and 

multiplexing means for including the control signal in 
each of the frames of the coded video signal, the state 
of the control signal indicating the frames of the coded 30 
video signal resulting from coding the lost-field frames 
of the progressive video signal. 

10. The apparatus of claim 9, wherein the coding means 
comprises: 

transform means for orthogonally transforming frames of 35 
the progressive video signal to produce respective sets 
of transform coefficients; 

decoding means for locally decoding the sets of transform 
coefficients using an inverse orthogonal transform to 
produce locally-decoded pictures; and 

means for applying predictive coding to ones of the 
frames of the progressive video signal using, for each 
of the ones of the frames of the progressive video 
signal, a selected one of the locally-decoded pictures as 
a reference picture. 

11. The apparatus of claim 9, additionally for providing a 
recording signal for recording on a recording medium, and 
additionally comprising: 

means for deriving a recording signal from the coded 

video signal; and 
means for recording the recording signal on the recording 

medium. 

12. The apparatus of claim 9, wherein 

the coding means is additionally for adding a picture 
header to each of the frames of the coded video signal, 
and for including the control signal in the picture 
header of each of the frames. 

13. The apparatus of claim 12, wherein, in lieu of includ- 
ing the control signal in the picture header of each of the 
frames of the coded video signal, the coding means includes 
the control signal in the picture header of only the frames 
resulting from coding the lost-field frames of the progressive 
video signal. 

14. Apparatus for decoding a coded video signal to 
provide an interlaced output video signal with a field rate of 
60 Hz, the coded video signal including frames derived by 
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coding respective frames of a progressive video signal 
having a frame rate of 24 Hz, the progressive video signal 
being derived from an interlaced input video signal with a 
field rate of 60 Hz by eliminating duplicate fields, the frames 
of the coded video signal including a control signal having 
a state indicating the frames of the coded video signal 
resulting from coding lost-field frames of the progressive 
video signal, the lost-field frames being frames of the 
progressive video signal wherefrom a duplicate field was 
eliminated, the apparatus comprising: 
decoding means for decoding the frames of the coded 
video signal to provide respective frames of a recon- 
structed progressive video signal; 
extracting means for extracting the control signal from the 

frames of the coded video signal; and 
field deriving means, operating in response to the control 
signal extracted from each one of the frames of the 
coded video signal, for deriving three fields of the 
interlaced output video signal from the respective one 
of the frames of the reconstructed progressive video 
signal when the state of the control signal indicates that 
the one of the frames of the coded video signal resulted 
from coding one of the lost-field frames, and deriving 
two fields of the interlaced output video signal from all 
others of the frames of the reconstructed progressive 
video signal. 

15. The apparatus of claim 14, wherein: 

in lieu of including the control signal in each of the frames 
of the coded video signal, the control signal is included 
only in the frames of the coded video signal derived 
from lost-field frames; and 

the deriving means derives three fields of the interlaced 
output video signal from those frames of the recon- 
structed progressive video signal resulting from decod- 
ing the frames of the coded video signal wherefrom the 
control signal is extracted by the extracting means. 

16. The apparatus of claim 14, wherein: 

frames of the coded video signal include respective sets of 

transform coefficients; and 
the decoding means includes: 

means for deriving the sets of transform coefficients 
from the coded video signal, 

inverse transform means for applying an inverse 
onhogonal transform to the sets of transform coef- 
ficients to provide respective sets of motion predic- 
tion errors, and 

means for reconstructing each of the frames of the 
reconstructed progressive video signal from a refer- 
ence picture and a respective one of the sets of 
motion prediction errors provided by inverse trans- 
form means. 

17. System for deriving a recording signal for transfer to 
a medium, the recording signal being derived from an input 
video signal, and for deriving an output video signal from 
the recording signal reproduced from the medium, the 
recording signal having a constant bit rate substantially 
lower than the input video signal and the output video signal, 
the input video signal and the output video signal having a 
field rate of 60 Hz, the input video signal being derived from 
a motion picture film source using 2-3 pulldown, the system 
comprising: 

an encoding apparatus comprising: 

detecting means for detecting duplicate fields in the 

input video signal, 
control signal generating means for generating a con- 
trol signal having a state indicating each of the 
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detected duplicated fields, 

eliminating means, operating in response to the state of 
the control signal, for eliminating the duplicate fields 
from the input video signal, 

generating means, operating on the input video signal 5 
following elimination of the duplicate fields by the 
eliminating means, for generating a progressive 
video signal composed of frames and having a frame 
rate of 24 Hz, 

coding means for coding the frames of the progressive 
video signal to produce respective frames of the 
recording signal, and 

multiplexing means for including the control signal in 
each of the frames of the coded video signal, the state 
of the control signal indicating the frames of the 
coded video signal resulting from coding the lost- 15 
field frames of the progressive video signal; and 
a decoding apparatus comprising: 

decoding means for decoding the frames of the record- 
ing signal to provide respective frames of a recon- 
structed progressive video signal, 20 

extracting means for extracting the control signal from 
the frames of the recording signal, and 

field deriving means, operating in response to the 
control signal extracted from each one of the frames 
of the coded video signal, for deriving three fields of 25 
the interlaced output video signal from the respective 
one of the frames of the reconstructed progressive 
video signal when the state of the control signal 
indicates that the one of the frames of the coded 
video signal resulted from coding one of the lost- 30 
field frames, and for deriving two fields of the 
interlaced output video signal from all others of the 
frames of the reconstructed progressive video signal. 

18. The system of claim 17, wherein: 

in the encoding apparatus, the coding means comprises: 

transform means for orthogonally transforming frames 
of the progressive video signal to produce respective 
sets of transform coefficients, 

means for including the sets of transform coefficients in 4Q 
the recording signal, 

decoding means for locally decoding the sets of trans- 
form coefficients using an inverse orthogonal trans- 
form to produce respective locally-decoded pictures, 
and 45 

means for applying predictive coding to ones of the 
frames of the progressive video signal using, for 
each of the frames of the progressive video signal, a 
selected one of the locally-decoded pictures as a 
reference picture; and ^ 
the decoding means additionally includes: 

means for deriving the sets of transform coefficients 
from the recording signal, 

means for applying an inverse orthogonal transform to 
the sets of transform coefficients to provide respec- 55 
tive sets of motion prediction errors, and 

means for reconstructing respective frames of the 
reconstructed progressive video signal from the sets 
of motion prediction errors. 

19. The system of claim 17, wherein: 60 
in the encoding apparatus: 

in lieu of including the control signal in each of the 
frames of the video signal, the multiplexing means 
includes in the recording signal as a control signal 
vbv_buffer_size data indicating a size for a hypo- 65 
thetical video buffer verifier used by a control means 
in the coding means to control coding of the coded 
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progressive video signal, and 
the coding means comprises: 
transform means for orthogonally transforming the 

frames of the progressive video signal to produce 

respective sets of transform coefficients; 
quantizing means for quantizing the sets of transform 

coefficients to produce respective sets of quantized 

coefficients; and 
means for including the sets of quantized coefficients in 

the recording signal, and 
the control means in the coding means controls the 

quantizing means using a hypothetical video buffer 

verifier with the size thereof reduced to vbv_ 

bufFer__size xVs; and 
in the decoding apparatus: 
the extracting means is for extracting the vbv_buffer_ 

size data from the recording signal, and 
the decoding apparatus additionally includes an input 

buffer means for receiving the recorded signal, the 

buffer means having a size defined by the vbv_ 

buffer__size data from the extracting means. 

20. The system of claim 17, wherein: in the encoding 
apparatus, 

the coding means is additionally for adding a picture 
header to each of the frames of the recording signal, 
and for including the control signal in the picture 
header of each of the frames; and 

in the decoding apparatus, the extracting means is for 
extracting the control signal from the picture header of 
each of the frames of the recording signal. 

21. The system of claim 20, wherein: 

in the encoding apparatus, in lieu of including the control 
signal in the picture header of each of the frames of the 
recording signal, the coding means includes the control 
signal in the picture header of only the frames resulting 
from coding the lost-field frames; and 

in the decoding apparatus, the deriving means derives 
three fields of the interlaced output video signal from 
those frames of the reconstructed progressive video 
signal resulting from decoding the frames of the record- 
ing signal the picture header whereof includes the 
control signal. 

22. A method for generating, from an input video signal, 
a coded video signal for transfer to a medium at a constant 
bit rate, the input video signal having a field rate of 60 Hz 
and being derived from a motion picture film source using 
2-3 pulldown, the method comprising steps of: 

providing: 

a hypothetical video buffer verifier having a size 

defined by vbv_buffer_size, and 
an output buffer, 

detecting duplicate fields in the input video signal; 

eliminating from the input video signal the duplicate 
fields detected in the detecting step; 

generating a progressive video signal from the fields of 
the input video signal that remain following the elimi- 
nating step, the progressive video signal comprising 
plural frames with a frame rate of 24 Hz; 

coding the progressive video signal to produce the coded 
video signal, the frames of the progressive video signal 
being coded with different numbers of bits; 

temporarily storing the coded video signal in the output 
buffer prior to transferring the coded video signal to the 
medium; and 

controlling the coding step using the hypothetical video 



Exhibit 22, page 34 



31 



5,461,420 



buffer verifier with the size thereof reduced to vbv_ 
• buffer_size x*/5 to prevent input buffer overflow result- 
ing from a duplication of fields that occurs when the 
coded video signal is decoded to provide a decoded 
video signal having a field rate of 60 Hz. 5 

23. The method of claim 22, wherein: 
the coding step comprises steps of: 

orthogonally transforming the frames of the progres- 
sive video signal to produce respective sets of trans- 
form coefficients, io 

quantizing the sets of transform coefficients to produce 
respective sets of quantized coefficients, and 

temporarily storing the sets of quantized coefficients in 
the output buffer as part of the coded video signal; 
and 15 

in the step of controlling the coding step, the quantizing 
step is controlled using the hypothetical video buffer 
verifier with the size thereof reduced to vbv_buffer_ 
size xYs. 

24. Apparatus for generating, from an input video signal, 20 
a coded video signal for transfer to a medium at a constant 
bit rate, the input video signal having a field rate of 60 Hz 
and being derived from a motion picture film source using 
2-3 pulldown, the apparatus comprising: 

detecting means for detecting duplicate fields in the input 
video signal; 

eliminating means for eliminating from the input video 
signal the duplicate fields detected by the detecting 
means; 

generating means for generating a progressive video 
signal from the fields of the input video signal remain- 
ing following elimination of the duplicate fields by the 
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eliminating means, the progressive video signal being 
composed of frames and having a frame rate of 24 Hz*; 
coding means for coding the progressive video signal to 
produce the coded video signal, the frames of the 
progressive video signal being coded with different 
numbers of bits; 

output buffer means for temporarily storing the coded 
video signal prior to transfer of the coded video signal 
to the medium; and 

control means for controlling the coding means using the 
hypothetical video buffer verifier with the size thereof 
reduced to vbv_buffer_size x% to prevent input buffer 
overflow resulting from a duplication of fields that 
occurs when the coded video signal is decoded to 
provide a decoded video signal having a field rate of 60 
Hz. 

25w The apparatus of claim 24, wherein: 
the coding means comprises: 
transform means for orthogonally transforming the 
frames of the progressive video signal to produce 
respective sets of transform coefficients, 
quantizing means for quantizing the sets of transform 
coefficients to produce respective sets of quantized 
coefficients, 

means for temporarily storing the sets of quantized 
coefficients in the output buffer as part of the coded 
video signal; and 

the control means controls the quantizing means using the 
hypothetical video buffer verifier with the size thereof 
reduced to vbv_buffer_size x*A 
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